MovieCuts: A New Dataset and Benchmark for Cut Type Recognition

نویسندگان

چکیده

Understanding movies and their structural patterns is a crucial task in decoding the craft of video editing. While previous works have developed tools for general analysis, such as detecting characters or recognizing cinematography properties at shot level, less effort has been devoted to understanding most basic edit, Cut. This paper introduces Cut type recognition task, which requires modeling multi-modal information. To ignite research this new we construct large-scale dataset called MovieCuts, contains 173, 967 clips labeled with ten cut types defined by professionals movie industry. We benchmark set audio-visual approaches, including some dealing problem’s nature. Our best model achieves $$47.7\%$$ mAP, suggests that challenging attaining highly accurate an open problem. Advances automatic Cut-type can unleash experiences editing industry, analysis education, re-editing, virtual cinematography, machine-assisted trailer generation, editing, among others. data code are publicly available: https://github.com/PardoAlejo/MovieCuts .

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

CORe50: a New Dataset and Benchmark for Continuous Object Recognition

Continuous/Lifelong learning of high-dimensional data streams is a challenging research problem. In fact, fully retraining models each time new data become available is infeasible, due to computational and storage issues, while naïve incremental strategies have been shown to suffer from catastrophic forgetting. In the context of real-world object recognition applications (e.g., robotic vision),...

متن کامل

A New Benchmark Dataset for Handwritten Character Recognition

The report presents a new dataset of more than 40, 000 handwritten characters. The creation of the new dataset is motivated by the ceiling effect that hampers experiments on popular handwritten digits datasets, such as the MNIST dataset and the USPS dataset. Next to a character labeling, the dataset also contains labels for the 250 writers that wrote the handwritten character, which gives the d...

متن کامل

a new type-ii fuzzy logic based controller for non-linear dynamical systems with application to 3-psp parallel robot

abstract type-ii fuzzy logic has shown its superiority over traditional fuzzy logic when dealing with uncertainty. type-ii fuzzy logic controllers are however newer and more promising approaches that have been recently applied to various fields due to their significant contribution especially when the noise (as an important instance of uncertainty) emerges. during the design of type- i fuz...

15 صفحه اول

MS-Celeb-1M: A Dataset and Benchmark for Large-Scale Face Recognition

In this paper, we design a benchmark task and provide the associated datasets for recognizing face images and link them to corresponding entity keys in a knowledge base. More specifically, we propose a benchmark task to recognize one million celebrities from their face images, by using all the possibly collected face images of this individual on the web as training data. The rich information pr...

متن کامل

Building a Large Scale Dataset for Image Emotion Recognition: The Fine Print and The Benchmark

Psychological research results have confirmed that people can have different emotional reactions to different visual stimuli. Several papers have been published on the problem of visual emotion analysis. In particular, attempts have been made to analyze and predict people’s emotional reaction towards images. To this end, different kinds of hand-tuned features are proposed. The results reported ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2022

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-031-20071-7_39